Changes summary
Change summary: WLC 4.20 to UXLC 2.2

Contributors to text changes

Alexander Adler Ben Denckla Moshe Escott Yishai Glasner
Moshe Greenberg Shalom Hakkohen Daniel Holman Allan Johnson
Seth (Avi) Kadish Charles Loder Gary Luhovey Yanir Marmor
Jonah Rank Asael Reiter Stephen Salisbury Shmuel Schreiber
Todd Shandelman Graham Thomason David Troidl Shmuel Weissman

How to see specific changes in a given book (i.e. Genesis):

Go to any verse in Genesis and click the blue "Unicode/XML Leningrad Codex [UXLC N.M]" label at the the top center of the page. Look at the table under the "Notes" section of the resulting page that displays all corrections to Genesis. After each chapter:verse.word citation is a description of the change. Clicking on any entry in the "Citation" column brings up the change details for that entry.

Tables of changes:

The root of the UXLC text is the Westminster Leningrad Codex (WLC 4.20, 25 January 2016 ) from the J. Alan Groves Center for Advanced Biblical Research. The most recent Unicode/XML representation of WLC 4.20 is available at this site as the Unicode/XML Leningrad Codex (Build 25.5, 24 Feb 2020).

Changes to the root text are made by one of two applications, never by hand. Changes to individual words are made by the Updater app processing specified Changes files. Bulk text changes are made through the ScanReplace app with specialized modules for each task. The Changes page summarizes the actions taken in each build.

In chronological order

Changes to individual words by Changes files processed by Updater app:

Build(s) Changes
file
UXLC Total Items Consonants Vowels Accents Notes Tags Mergers/
Splits
26.0 2020.02.19 1.0 16 16 0 0 16 0 0 0
26.1 2020.10.19 1.1 20 19 0 5 11 2 2 0
26.2 2021.04.01 1.2 61 49 0 20 24 15 0 2
26.3
26.4
2021.10.19 1.3 52 49 0 1 17 34 0 0
26.5 2022.04.01 1.4 64 52 0 35 13 16 0 0
26.6 2022.07.04 1.5 97 78 0 46 21 30 0 0
26.7 2022.10.19 1.6 118 106 0 33 34 50 0 1
26.8 2022.12.07 1.7 191 151 2 52 75 59 0 3
26.9 2023.04.01 1.8 166 152 2 70 53 39 0 2
27.0 2023.07.04 1.9 173 136 0 37 85 49 0 2
27.1 2023.10.19 2.0 161 135 0 60 46 54 0 0
27.2 2024.04.01 2.1 103 107 0 41 39 23 0 0
27.3 2024.10.19 2.2 138 135 4 38 6 83 0 7
 
Totals: 1360 1185 8 438 440 454 2 17
Total Items Consonants Vowels Accents Notes Tags Mergers/
Splits

A single change Item can result in multiple text changes.

Changes to the overall text by processing with ScanReplace modules:

See the build documentation on the Changes page for details of these actions.

Build(s) Date UXLC Module Purpose
26.1 2020.10.19 1.1 InsertCGJs Adds Combining Grapheme Joiners (CGJs) to Jerusalem and corrects leading meteg words.
26.1
26.2
2020.10.19
2021.04.01
1.1
1.2
WLCEdit Removes most Groves Center transcription notes and all morphological division markers from the text.
26.3
26.4
2021.10.19 1.3 YBYAH Replaces selected yerah ben yomos (YBYs) in Sifrei Emet with atnah hafukhs (AHs).
27.3 2024.10.19 2.2 RemoveNewlines Removes new line characters (\n) from w, k, and q tags. Applied only after all changes have been made by Updater.

Processing details:

WLC 4.20 (25 January 2016) is converted to Unicode/XML via the program WLC2XML and the Documentary Hypothesis (DH) markings are applied by the program DHMarker. This produces the Unicode/XML files for the Unicode/XML Westminster Leningrad Codex (WLC 4.20, Build 25.5) of 24 Feb 2020.

The 3 ScanReplace modules (WLCEdit, InsertCGJs, YBYAH) are applied to the text. They are designed not to interact with each other; that is, processing order has no effect.

Then the Updater Change files are processed in chronological order. To make a change specified by a Change file Updater requires an exact match between the input text and the reference text specified in the Change file; a mismatch causes a program crash identifying the change with the discrepancy. This minimizes erroneous changes. Processing order is critical. Note: a change made in an earlier Change file may be undone by a change in a later Change file.

Processing by Updater may introduce new line (\n) characters into w, k, and q tags. Although these characters are invisible to the display software, they make reading the XML more difficult. The final step in the processing is to apply the ScanReplace module RemoveNewlines to the text to remove these characters.

  27.3